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Abstract 

We study a scenario under which variable step random walks give anomalous statistics. We 
begin by analyzing the Martingale Central Limit Theorem to find a sufficient condition for the 
limit distribution to be non-Gaussian. We note that the theorem implies that the scaling index ^ 
is ^. For corresponding continuous time processes, it is shown that the probability density function 
W{x;t) satisfies the Fokker-Planck equation. Possible forms for the diffusion coefficient are given, 
and related to W{x,t). Finally, we show how a time-series can be used to distinguish between 
these variable diffusion processes and Levy dynamics. 



1 



I. INTRODUCTION 



Under which conditions can statistics of stochastic processes be anomalous? Such statis- 
tics have been observed in temperature and longitudinal velocity fluctuations in highly turbu- 



m 




instantaneous velocities of gusting winds [6] and price variations 
. Furthermore, in the case of flnancial markets, it has 



lent fluid flows 
in flnancial markets 

been noted that the probability density functions exhibit self-similarity 



Levy statistics 



13 



l4 list hierarchical processes jl6| have been suggested as possible 



m 



In prior work, 



causes of anomalous behavior. In this paper we suggest an alternative scenario where, in 
contrast to Levi processes, stochastic increments have uniformly bounded variance and are 
not independent. 

General conditions for the validity of the Central Limit Theorem (CLT) are given for 
martingales, which are deflned in Section |n] We discuss how the limit distribution can 
fail to be Gaussian, and provide a set of conditions that give non-Gaussian statistics. In 
Section ITTTj we argue that the corresponding probab ility density W{x;t) for continuous time 



ibabiiitv 



processes satisfles the Fokker-Planck equation and scales with index ({= i); i.e., 

W{x;t) = -^F{u), where u = By using the Fokker-Planck equation, we show that the 
diffusion coefficient D{x; t) for the process takes a specific form. It is shown that a reduction 
of D{x;t) to a form D{u) preserves all statistical features of the stochastic process. Given 
D{u), we then provide an explicit expression for F{u). As examples, we provide forms 
for D{u) that give exponential and power-law distributions for F{u). In Section HV1 we 
provide a criterion that can be used to distinguish these newly introduced variable diffusion 
processes from Levy statistics. Previous analysis of fluctuations in flnancial markets appear 
to contradict the Levy mechanism. 

Throughout the paper, we will relate our results to turbulent flows and flnancial markets 
to illustrate implications of our assertions. However, it should be emphasized that our work 
is a theory of neither of these systems. 
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II. MARTINGALE CENTRAL LIMIT THEOREM 



The classical CLT in the context of identical independently distributed events {sk} with 
zero mean and variance o"^, states that 



n 
k=l 



ek^MiO,a') (1) 

as n ^ oo, where A/'(0, cr^) denotes a zero-mean normal distribution. Here the convergence 
is in distribution; i.e., for each a, 

lim P ( -^Ysk <a] = — ^ r e-"'/2"'da;. (2) 



The CLT can be generalized for a class of non-independent processes, referred to as martin- 
gales. We describe the setup only for the case of interest to us. 

We consider a random walk (on the real line) starting at xq = 0, with steps denoted by 
£ = (ei, 62, . . . ). The step Ek can depend on its history {i.e., the previous {k — 1) steps). 
The position after n steps is denoted by x„ = Ylk=i ^k- A probability measure p is given on 
the space of the infinite sequences; p induces a measure pn(s*-"'^) on the space of first n steps 
£:("■). When the context is clear, we denote p„ by p. If the random variables Sk are discrete, 
then pn includes 5-functions. 

The conditional probability of the k*^ step given the history e^'^"^^ is defined by 

The random variables {x„}„>o form a martingale if each increment (or martingale difference) 
Ek has zero conditional mean: i.e., if for each k > 1 

E[ek\e^'-'^=Q, (4) 

for all histories e^''^^\ (Note that, although this mean value is independent of the history 
of the walk, the conditional probability density given in Eqn. Q can depend on s^''~^\) 
Finally define the expected value of the location over all n-step random walks by 

E [X„] = j Xn dp (£(")) = j Xn P (s^")) , (5) 

where the second formula is written just to emphasize the variables over which the integration 
takes place. Denote the corresponding variance by Var We have the following Lemma 
for martingale processes. 

Lemma: If {xn} is a martingale process with xq = 0, then 



(I) E [Xn] = 0. 

(II) Var [xn] = ELi ^"-^ [^fc]- 
These results can be proved inductively using 

j &„+ip(£„+i|£(")) =1, (6) 

and 

J den+lSn+lP = E [Sn+M"'^] = 0. (7) 

For martingales, Theorem 3.2 of Ref. gives a more general form of the CLT. Recall 
that a sequence yn of random variables is said to converge in probability to a random variable 
y if for any 6 > 0, the probability of |?/„, — y| > 5 goes to zero as n ^ oo. 
Theorem (Martingale Central Limit Theorem) Suppose that ei, £2, • • • are square-integrable 
martingale differences such that 

(1) maxi<fc<„ {\ek\/^/n) ^ in probability, 

(2) X]fc=i^fc/^ ~^ probability, 

(3) E [maxi<fc<„ (el/n)] is bounded in n. 



where the random variable rj is finite with measure 1. Then 

[sk^Z, (8) 



n 
k=l 

where the convergence is in distribution (see Eqn. Q), and the random variable Z has 
characteristic function [i.e., E [exp{itZ)]) given by 

1 



E [exp{itZ)] = E 



exp^-^rft^) 



for all t. (9) 



Observe that the martingale differences Sk are not required to be independent or to 
be distributed identically. However, when the conditions of the theorem are satisfied, the 
distribution of the random variable = converges to E{u), the distribution of Z. We 
will refer to this property as scalability with scaling index C = |- 

We first provide a necessary and sufficient condition to obtain Gaussian statistics. 
Lemma: If the random variables Z and rj satisfy Eqn. 0, then Z is Gaussian if and only 
if r/^ is a constant. 

Indeed, if r]"^ is constant, say cr^, then Z has characteristic function exp 
therefore it is normally distributed with variance a^. Conversely, if Z has mean zero and 



is a Gaussian with variance o"^, then its characteristic function equals exp {—t^ a"^ / 2) . Write 
s = g = rf, and define the probabihty measure dP = e"^ e~^dP. Then all moments of 
g exist with respect to P. The equality 

implies, upon differentiating with respect to s and setting s = 1, that E[g^] = cr'^^ for all n. 
Thus is constant (Theorem 3.11 of Ref. jisl]). 

Next, we identify a set of conditions that gives anomalous statistics for m„. Condition (3) 
of the martingale CLT is satisfied if increments Sk have a variance bounded uniformly in n 
{i.e., there exists a c > such that for all k, Var [sk] = E [el] < c). To see this, note that 



E 



max 

l<A;<n 



{el/n) 



< E 



.k=l 



1 

EVar [Ek] < c. 



n 

k=l 



where the equality follows from the previous Lemma. Condition (1) is satisfied under the 
stronger property that there exists 5 > and Ci > such that for all k 

E[\ek?+'] < ci. (10) 

This can be seen from 

Proh I max {\ek\/Vn) > /S) < ^Proh {\ek\/Vn > f3) = ^Proh {\ek\ > Vn(3) 

^ " ^ k=l k=l 

and the fact that 

ci > E[\ek\'^'] > iV^PY+'Prob {\ek\ > V^P) . 
Therefore Prob{\6k\ > ^/nP) < Ci/^/n , which implies that 



Proh ( max (\6k\/\/n) > /? ) < n — 



as n — >• OO. 

What remains is to find martingales that satisfy condition (2) where r]^ is not a constant. 
If efc's are independent and identically distributed with finite variance a^, then from the 
classical CLT 77^ = cr^ in probability. Once e^'s are allowed to be history dependent, the 
conditions for convergence of {^^jn) becomes non-trivial, as illustrated by the following 
example: consider a stochastic process with independent steps, consisting of steps from 
a distribution with finite variance a\, followed by M\ steps from a distribution with finite 
variance ct|, followed by steps from the first process, M2 steps from the second, etc. For 



suitable choices of A^i -C Mi N2 M2 <^ . . ., rf moves between al and a^, and fails to 
converge. Convergence of requires more stringent conditions on the stochastic process. 

For processes introduced in the next section, the distribution of rf is not a (5-function, as 
shown in the Appendix. 

We conclude this section with the following observations. First, we reiterate that once 
the conditions of the martingale CLT are satisfied, Un = a;„/ y/n converges to a distribution 
F{u)] i.e., the scaling index ( is |. Second, in contrast to Levy processes, increments Ek 
are not independent. Further, for the examples we consider, the e^'s satisfy Eqn. ()10|) (at 
least, according to the numerical simulations). Note however that the conditional variance, 
Var [en+i\s^'^^~\ , is not required to be uniformly bounded. 



III. CONTINUOUS MARKOV PROCESSES 



In order to study continuous processes, divide the interval t into subintervals of 6t and 
let n = t/6t; it is assumed that 6t is sufficiently large for many martingale increments 
to occur in this interval. Now, let e^s denote the martingale increments in intervals 6t. 
In order for the variance of increments in 1 unit of time to be uniformly bounded, it is 
necessary and sufficient (see the first Lemma) that -^Var[ek] be uniformly bounded; i.e., 
that Var Ek/ VSt be uniformly bounded. A priori, the limit Z may depend on the particular 
discretization used. For the examples given below, this is not the case, although we have 
not been able to derive it analytically as yet. 

For the remainder of the paper, we limit considerations to Markov processes; i.e., 
p (efcls^^^^'') = p{ek\xk-i; (k — 1)) for each k; here, the possible dependence of the prob- 
abihty density on the step number (see. Section \^ is denoted explicitly. Markov processes 
satisfy the master equation 



W{x;t + St) = j deW {x-e;t)pst{e\{x-e)-t), (11) 

where pst{£\x;t) denotes the probability density function for an increment e to occur in 
time 5t beginning from (x; t). Taylor expanding in the variables t and x about W{x\ t), and 



noting that Var 



e/\/5t 



is bounded, gives the Fokker-Planck equation 



Q^W{x-t) = -—{D{x-t)W{x-t)) 
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(12) 
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x: t 



(13) 



where the diffusion coefficient D{x;t) is given by 

D{x]t) = — [ de e'^p{e\x]t) = Var 
dt J 

The derivation assumes the martingale condition E' t] = J de e pst{£\x; t) = 0. 

Observe next that, since the scahng index C = \y the probabihty density for scalable 
martingales can be written as 

Wix;t) = ^Fiu), (14) 



where u = x/y/t, and the pre-factor l/yt has been included in order that W{x;t) be 
normalized {i.e., J dxW{x;t) be time-independent). Only certain forms of D{x;t) can be 
consistent with this requirement. In order to obtain them, change variables so that D{x; t) = 
D{u;t). Substituting in the Fokker-Planck equation gives 



^ (uF{u)) + ^{D{u;t)F{u))=0, 



du 



which can be integrated to 



d 



uF{u) + — {D{u;t)F{u)) = ci(t). 



Here ci(t) is the "constant" of integration. Integrating a second time gives 



D{u-t) 



dv vFiv) + 



{ci{t)u + C2{t)) . 



(15) 



(16) 



(17) 



F{u) ^ ^ F{u) 

where C2(t) is the second constant of integration. 

Examples: The Gaussian distribution F{u) = exp (— ^m^) corresponds to D{u;t) = 1 + 
{uci{t) + C2(t)) exp {^u^)- The exponential distribution F{u) = exp {—\u\) corresponds to a 
diffusion coefficient D{u; t) = {1 + \u\) + {uci(t) + C2(t)) exp {~\u\). 

Note that the terms in D{u;t) that contain Ci(t) and C2(t) do not change the form of 
W{x;t). Hence, they will not be considered in the remainder of the paper; i.e., only the 
t-independent part of D{u;t), henceforth denoted D{u), will be considered. 

Conversely, if the diffusion coefficient D{u) is given, Eqn. ()15|) can be integrated to give 



Fiu) 



Du) 



exp \ — dv 



Dv) 



dv exp ( / dw 



w 



D{w) 



+ 02 



(18) 



where Oi and 02 are constants of integration. If D{u) is symmetric under reflections about 
the origin and the process begins at Xq = then F{u) is symmetric 3|; consequently Oi = 0, 
as can be seen from the anti-symmetry of the left side of Eqn. (|T6|) . Then, 



F{u) 



D{u) 



exp 



dv 



D{v) 



(19) 



The form of F{u) for selected diffusion rates is given next. As mentioned in Sec- 
tion ini although Var 

Var 



for each n is uniformly bounded, the conditional variance 
En/ V 5t\x] t of the martingale differences, given by D{u), is not required to be bounded 
with respect to u. 
Examples: 

(I) D{u) = 1 — > F{u) = exp (-1^2) 
(II) D{u) = 1 + a\u\ ^ F{u) = exp / (1 + 

(III) D{u) = 1 + |m| — > F{u) = exp {-\u\) 

(IV) D{u) = (1 + au^) F{u) = (1 + aM2)-(i+(i/2a)) 

Thus, suitable choices of D{u) can give exponential or power-law behavior in F{u). Note 
that, in the final example a < 1 is needed in order for the condition pO|) to be satisfied. 

We have confirmed numerically that stochastic dynamics with diffusion coefficients given 
in these examples give probability density functions consistent with the analytically derived 
expressions. These computations were conducted by integrating the (zero-drift) Langevin 
equation dX = [D(X(t); t)]^/^A/'(0, dt) [2^. The integrations are done using Ito calculus; 
i.e., it is assumed that each step in the integration consists of a large number of stochastic in- 
crements and that variations in D{X{t); t) during the interval can be ignored. Consequently, 
the deviations in a time interval 5t lie on A/'(0, \/D5t). 



IV. DISTINGUISHING BETWEEN LEVY AND VARIABLE DIFFUSION PRO- 
CESSES 

Given a stochastic process {Ek} such that Xn lies on a scalable, non- normal distribution, is 
it possible to determine if Levy or variable diffusion processes are the more likely source of the 
dynamics? More precisely, is it possible to eliminate one of the scenarios as the underlying 
cause of the observed stochastic dynamics? One possible criterion is to test if the variance of 
the stochastic process is finite (variable diffusion) or infinite (Levy). However, it is difficult 
to make this determination from a finite time series. An alternative is to use the fact that 
while successive movements of a Levy process are independent, those in the variable diffusion 
case depend on the location and time of the walk. For example, if D{u) increases with |m| (as 
in the examples above), then large movements are likely to leave x„ (and hence D{u)) large; 
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FIG. 1: Examples of 10,000 steps from a (a) Levy distribution with = 2/3 and (b) variable 
diffusion process with D{u) = 2(1 + n^). Unlike in (a), a large fluctuation in (b) is generally 
followed by movements with higher amplitude. 

consequently, a large fluctuation can generally be expected to be followed by additional 
(positive or negative) large increments. Levy processes with independent increments will 
not exhibit such correlations. Dynamics of Levy and variable diffusion processes, shown in 
Figure m illustrates the difference. 

Thus, one may consider distinguishing variable diffusion and Levy processes using the 
auto-correlation function of a time series. However, since the mean value of the increments 
is zero in for each case (since they are martingales), the auto-correlation will vanish. On 
the other hand, auto-correlation function of {s'l}^'^^ will only vanish for the Levy case. 
Specifically, for a random time series of length n, we use 

1 



C(m; n) 



Var fe^l 



(20) 



where (.) denotes the average over k. For Levy processes, C{m; n) vanishes for m > 0, while 
for variable diffusion processes with D{u) = 1 + 1^1, it is found to decay as exp{—am/n); 
the n-dependence implies that a longer series contains larger fluctuations. 

For fluctuations in flnancial markets, C(m; n) is known to exhibit a slow decay with m 



22| | . This phenomenon, referred to as "clustering of volatility," suggests that Levy processes 



are unlikely to be the source of scalable non-Gaussian distributions in flnancial markets. 
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V. DISCUSSION 



The theory we have presented is not merely a reformulation where an observed scal- 
able probability density function W{x; t) is recast into a suitably chosen diffusion coefficient 
D{x;t). Rather, it introduces a new class of stochastic dynamics. Unlike Levy processes, 
the increments considered in our work, although Markovian, are not independent. In addi- 
tion, they have finite variances. The scaling index for scalable diffusion processes takes a 
unique value C = |- The probability density function W{x; t) for continuous time stochastic 
dynamics takes the form -^F{u) and satisfies the Fokker-Planck equation. The diffusion 
coefficient can be chosen to be a function of u, and there is a correspondence between F{u) 
and the diffusion coefficient D{u). 

The fact that successive events are independent in Levy processes and only martingales in 
our variable diffusion processes implies that dynamics can be used to identify which model 
is more suitable to represent a given time series of stochastic events. We propose the use of 
the auto-correlation of e^'s as such a test. Previous studies of financial markets suggest that 
they consist of increments that are not independent, and hence suggest that independent 
Levy processes are unlikely to be the correct explanation for the observed non-Gaussian 



probability density functions 



221. 



The need for x-dependent diffusion coefficients implies that the stochastic dynamics is 
not invariant under translations in x. In particular, for the examples given earlier, the origin 
is both the starting point of the walk as well as the location where D{x\ t) is minimized. In 
financial markets, one does expect any sudden large fluctuation in the price of a stock to 
be followed by a period of high anxiety in the part of traders; consequently the stock can 
be expected to trade at a significantly higher rate. This is equivalent to an increase in the 
diffusion rate. However, if the price of the stock settles at this new value, it is likely that the 
location of the minimum in D{x]t) will move towards it. Thus, a more realistic model of 
financial markets would involve a coupled variation of the price of the stock and the location 
of the minimum of the diffusion coefficient j^] • 

A time-dependent, but x-independent drift of the stochastic process can be intro- 
duced by including a "drift" term — fi{t)W {x; t) on the right side of the Fokker-Planck 
equation [2^. Redefining m to be — fi{s)ds^ gives Eqn. (fT2j) . and the rest of the 

analysis presented here follows. 
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APPENDIX A: ANOMALOUS MARTINGALE PROCESSES 

When the diffusion coefficient is a function of m, the martingale sums may fail to lie on 
a normal distribution. We have chosen processes where E [le^p"'"'^] is uniformly bounded, 
so that conditions (1) and (3) of the martingale CLT are satisfied. Hence, the random 
variable Z is not distributed normally because {^/nj^ej, does not approach a constant (in 
probability) for large n. We illustrate this failure with two examples of discrete random 
walks. 

The distribution of 7f for a finite-step martingale with D{u) = 1 + Im] is shown in Figure 
I2fa). Since D{u) > 1 for all u, rj^ is non- vanishing only when the argument is larger 
than 1, where it decays exponentially. As expected from the analysis, F{u) is found to be 
\exp (— |m|). 

Next, consider a martingale with D{u) = {1 + tanh \u\). For a fixed t, D{u) varies 
between 1 and 2, and for a fixed x, it reduces to 1 with increasing t. The histogram of rf, 
computed numerically for a set of 100,000 random walks of length 100,000, converges to the 
function shown in Figure Efb). Since 1 < D{u) < 2, r/^ is non-zero only in the interval [1, 2]. 
The corresponding probability density function W{x;t) has the form ■^F{u), but F{u) 
is not Gaussian. In contrast, if the diffusion coefficient is chosen to be {1 + tanh \x\) or 
[l + tanh (l/Vt)), rf is found to be constant, and W{x:, t) is found to approach a Gaussian. 
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